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<120> Methods and Materials for synthesis of organic products 
<130> MBHB00-1237- 



<140> 
<141> 



09/992,430 
2001-11-23 



<150> 
<151> 



60/252541 
2000-11-22 



<160> 65 

<170> Patentln version $.1 

<210> 1 

<211> 92 

<212> DNA 

<213>. Artificial sequence 
<220> 

<223> Multiple cloning sitel 




<400> 1 

cccaagcttg aattccccgg gggatccdtg cagggtacca cgcgtagatc tactagtgcg 



ccgcctcga gtctagaggg cccaagcttig gg 



2 

91 
DNA 

Artificial sequence 



60 
92 



<220> 

<223> Multiple cloning site 
<400> 2 

ccaagcttgg gccctctaga ctcgaggcgg ccg\:actagt agatctacgc gtggtaccct 
gcagggatcc cccggggaat tcaagcttgg g 



60 
91 



<210> 3 

<211> 31 

<212> DNA 

<213> Lactobacillus helveticus 



<400> 3 

ccgggatcca tggcaagaga ggaaaaacct c 



31 



1 




<210> 4 
<211> 32 
<212> DNA 

<213> Lactobacillus helveticus 
<400> 4 

ccaagatctt tattgacgaa ccttaacgcc ag 

<210> 5 
<211> 37 
<212> DNA 

<213> Pediococcus acidilactici 
<400> 5 

ccgggatcca tgtctaatat tcaaaatcat caaaaag 

<210> 6 
<211> 33 
<212> DNA 

<213> Pediococcus acidilactici 
<400> 6 

ccaagatctt tatttgtctt gtttttcagc aag 



<210> 7 
<211> 82 
<212> DNA 

<213> Kluyveromyces marxianus 
<400> 7 

taaacagtac aatcgcaaag aaaagctcca cacccaaacc aaataattgc aatgcaactt 
cttttctttt tttttctttt ct 



<210> 8 
<211> 79 
<212> DNA 

<213> Kluyveromyces marxianus 
<400> 8 

ttataaaatc attaaaatcc aaaatcgtaa tttatctctt tatcctctcc ctctctacat 
gccggtagag gtgtggtca 



<210> 9 

<211> 1736 

<212> DNA 

<213> Unknown 



<220> 



<223> Kanamycin resistance gene 
<400> 9 

gtacaacttg agcaagttgt cgatcagctc ctcaaattgg tcctctgtaa cggatgactc 60 

aacttgcaca ttaacttgaa gctcagtcga ttgagtgaac ttgatcaggt tgtgcagctg 120 

gtcagcagca tagggaaaca cggcttttcc taccaaactc aaggaattat caaactctgc 180 

aacacttgcg tatgcaggta gcaagggaaa tgtcatactt gaagtcggac agtgagtgta 240 

gtcttgagaa attctgaagc cgtattttta ttatcagtga gtcagtcatc aggagatcct 300 

ctacgccgga cgcatcgtgg ccgacctgca gggggggggg gggcgctgag gtctgcctcg 360 

tgaagaaggt gttgctgact cataccaggc ctgaatcgcc ccatcatcca gccagaaagt 420 

gagggagcca cggttgatga gagctttgtt gtaggtggac cagttggtga ttttgaactt 480 

ttgctttgcc acggaacggt ctgcgttgtc gggaagatgc gtgatctgat ccttcaactc 540 

agcaaaagtt cgatttattc aacaaagccg ccgtcccgtc aagtcagcgt aatgctctgc 600 

cagtgttaca accaattaac caattctgat tagaaaaact catcgagcat caaatgaaac 660 

tgcaatttat tcatatcagg attatcaata ccatattttt gaaaaagccg tttctgtaat 720 

gaaggagaaa actcaccgag gcagttccat aggatggcaa gatcctggta tcggtctgcg 780 

attccgactc gtccaacatc aatacaacct ttaatttccc ctcgtcaaaa ataaggttat 840 

caagtgagaa atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca 900 

ttctttccag acttgttcaa caggccagcc attacgctcg tcatcaaaat cactcgcatc 960 

aaccaaaccg ttattcattc gtgattgcgc ctgagcgaga cgaaatacgc gatcgctgtt 1020 

aaaaggacaa ttacaaacag gaatcgaatg caaccggcgc aggaacactg ccagcgcatc 1080 

aacaatattt tcacctgaat caggatattc ttctaatacc tggaatgctg ttttcccggg 1140 

gatcgcagtg gtgagtaacc atgcatcatc aggagtacgg ataaaatgct tgatggtcgg 1200 

aagaggcata aattccgtca gccagtttag tctgaccatc tcatctgtaa catcattggc 1260 

aacgctacct ttgccatgtt tcagaaacaa ctctggcgca tcgggcttcc catacaatcg 1320 

atagattgtc gcacctgatt gcccgacatt atcgcgagcc catttatacc catataaatc 1380 

agcatccatg ttggaattta atcgcggcct cgagcaagac gtttcccgtt gaatatggct 1440 

cataacaccc cttgtattac tgtttatgta agcagacagt tttattgttc atgatgatat 1500 

atttttatct tgtgcaatgt aacatcagag attttgagac acaacgtggc tttccccccc 1560 

ccccctgcag gtcggcatca ccggcgccac aggtgcggtt gctggcgcct atatcgccga 1620 

3 




catcaccgat 


ggggaagatc 


gggctcgcca 


cttcgggctc 


atgagcgctt 


gtttcggcgt 


1680 


gggtatggtg 


gcaggccccg 


tggccggggg 


actgttgggc 


gccatctcct 


tgcatg 


1736 


<210> 10 
<211> 372 
<212> DNA 

<213> Kluyveromyces marxianus 










<400> 10 
ccggttcttt 


ctcttactct 


tacaagacca 


agaacattgt 


cgaattccac 


tccgactaca 


60 


tcaaggtcag 


aaacgccact 


ttcccaggtg 


tccaaatgaa 


gttcgtcttg 


caaaagttgt 


120 


tgaccaaggt 


caaggatgct 


gctaagggtt 


acaagccagt 


tccagttcct 


cacgctccaa 


180 


gagacaacaa 


gccagttgct 


gactctactc 


cattgaagca 


agaatgggtc 


tggactcaag 


240 


tcggtaagtt 


cctacaagaa 


ggtgatgttg 


ttctaactga 


aaccggtacc 


tccgctttcg 


300 


gtatcaacca 


aacccacttc 


ccaaatgaca 


cctacggtat 


ctcccaagtc 


ttgtggggtt 


360 


ccattggttt 


ca 










372 


<210> 11 
<211> 747 
<212> DNA 

<213> Kluyveromyces marxianus 










<400> 11 
ttaccactgt 


cttcggtctg 


ccaggtgact 


tcaatctgcg 


tctgttggac 


gagatctacg 


60 


aggtcgaggg 


tatgagatgg 


gccggtaact 


gtaacgagtt 


gaacgcttct 


tacgctgccg 


120 


acgcttacgc 


cagaatcaag 


ggtatgtcct 


gtttgatcac 


caccttcggt 


gtcggtgagt 


180 


tgtccgcttt 


gaacggtatc 


gccggttctt 


acgctgagca 


cgtcggtgtc 


ttgcacattg 


240 


tcggtgtccc 


atccgtctcc 


gcccaggcca 


agcagctatt 


gttgcaccac 


accttgggta 


300 


acggtgactt 


cactgtcttc 


cacagaatgt 


ccgccaacat 


ctctgagacc 


actgctatga 


360 


tcactgatct 


agctaccgcc 


ccatctgaga 


tcgacagatg 


tatcagaacc 


acctacatta 


420 


gacagagacc 


tgtctacttg 


ggtttgccat 


ctaacttcgt 


tgaccagatg 


gtcccagcct 


480 


ctctattgga 


caccccaatt 


gacttggcct 


tgaagccaaa 


cgaccagcag 


gctgaggagg 


540 


aggtcatctc 


tactttgttg 


gagatgatca 


aggacgcuaa 


y aacccay ui- 


aULLiyyt uy 


600 

U \J \J 


acgcttgcgc 


ttccagacac 


gatgtcaagg 


ctgagaccaa 


gaagttgatt 


gacatcactc 


660 


agttcccatc 


tttcgttacc 


ccaatgggta 


agggttccat 


tgacgagaag 


cacccaagat 


720 


tcggtggtgt 


ctacgtcggt 


accttgt 








747 




<210> 12 
<211> 1738 
<212> DNA 
<213> Unknown 

<220> 

<223> kanamycin resistance gene fragment 
<400> 12 

gtacaacttg agcaagttgt cgatcagctc ctcaaattgg tcctctgtaa cggatgactc 60 

aacttgcaca ttaacttgaa gctcagtcga ttgagtgaac ttgatcaggt tgtgcagctg 120 

gtcagcagca tagggaaaca cggcttttcc taccaaactc aaggaattat caaactctgc 180 

aacacttgcg tatgcaggta gcaagggaaa tgtcatactt gaagtcggac agtgagtgta 240 

gtcttgagaa attctgaagc cgtattttta ttatcagtga gtcagtcatc aggagatcct 300 

ctacgccgga cgcatcgtgg ccgacctgca gggggggggg gggcgctgag gtctgcctcg 360 

tgaagaaggt gttgctgact cataccaggc ctgaatcgcc ccatcatcca gccagaaagt 420 

gagggagcca cggttgatga gagctttgtt gtaggtggac cagttggtga ttttgaactt 480 

ttgctttgcc acggaacggt ctgcgttgtc gggaagatgc gtgatctgat ccttcaactc 540 

agcaaaagtt cgatttattc aacaaagccg ccgtcccgtc aagtcagcgt aatgctctgc 600 

cagtgttaca accaattaac caattctgat tagaaaaact catcgagcat caaatgaaac 660 

tgcaatttat tcatatcagg attatcaata ccatattttt gaaaaagccg tttctgtaat 720 

gaaggagaaa actcaccgag gcagttccat aggatggcaa gatcctggta tcggtctgcg 780 

attccgactc gtccaacatc aatacaacct attaatttcc cctcgtcaaa aataaggtta 840 

tcaagtgaga aatcaccatg agtgacgact gaatccggtg agaatggcaa aagcttatgc 900 

atttctttcc agacttgttc aacaggccag ccattacgct cgtcatcaaa atcactcgca 960 

tcaaccaaac cgttattcat tcgtgattgc gcctgagcga gacgaaatac gcgatcgctg 1020 

ttaaaaggac aattacaaac aggaatcgaa tgcaaccggc gcaggaacac tgccagcgca 1080 

tcaacaatat tttcacctga atcaggatat tcttctaata cctggaatgc tgttttcccg 1140 

gggatcgcag tggtgagtaa ccatgcatca tcaggagtac ggataaaatg cttgatggtc 1200 

ggaagaggca taaattccgt cagccagttt agtctgacca tctcatctgt aacatcattg 1260 

gcaacgctac ctttgccatg tttcagaaac aactctggcg catcgggctt cccatacaat 132 0 

cgatagattg tcgcacctga ttgcccgaca ttatcgcgag cccatttata cccatataaa 1380 



5 




tcagcatcca 


tgttggaatt 


taatcgcggc 


ctcgagcaag 


acgtttcccg 


ttgaatatgg 


1440 


ctcataacac 


cccttgtatt 


actgtttatg 


taagcagaca 


gttttattgt 


tcatgatgat 


1500 


CL L-GL U» U- I- L. L>CL 


r*t~ t~crt"crcaat 

v_r ^> y ^ y \« u. n u> 


crtaacatcacr 


acrafc t 1 taacr 


acacaaccrtcr 


gctttccccc 


1560 


ccccccctgc 


aggtcggcat 


caccggcgcc 


acaggtgcgg 


ttgctggcgc 


ctatatcgcc 


1620 


gacatcaccg 


atggggaaga 


tcgggctcgc 


cacttcgggc 


tcatgagcgc 


ttgtttcggc 


1680 


gtgggtatgg 


tggcaggccc 


cgtggccggg 


ggactgttgg 


gcgccatctc 


cttgcatg 


1738 



<210> 13 

<211> 17 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> degenerative amplification primers 

<400> 13 

gtbatyggyt chggtac 

<210> 14 

<211> 17 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> degenerative amplification primers 

<400> 14 

swrtcdccrt gytcacc 

<210> 15 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> amplification primers 

<400> 15 

gtacagttct ggatactgct eg 



<210> 16 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> amplification primers 



6 
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<400> 16 

acaggcatcg atgctgtc 



18 



<210> 17 
<211> 21 
<212> DNA 

<213> Kluyveromyces thermotolerans 
<400> 17 

ctacttggag ccactatcga c 21 



<210> 18 
<211> 19 
<212> DNA 

<213> Kluyveromyces thermotolerans 
<400> 18 

gtgatgtcgg cgatatagg 



<210> 19 
<211> 21 
<212> DNA 

<213> Kluyveromyces thermotolerans 
<400> 19 

gatctcctgc taagctcttg c 



<210> 20 
<211> 20 
<212> DNA 

<213> Kluyveromyces thermotolerans 
<400> 20 

gcagttttgg atattcatgc 



<210> 21 

<211> 972 

<212> DNA 

<213> Kluyveromyces thermotolerans 
<220> 

<221> CDS 

<222> (1)..(972) 

<223> 



<400> 21 
atg ttc caa 
Met Phe Gin 
1 



gat aca aag tct caa gca gta aga act gat gcc aaa aca 
Asp Thr Lys Ser Gin Ala Val Arg Thr Asp Ala Lys Thr 
5 10 15 



48 



gta aaa gtt 



gtg gta gtg gga gtg gga agt gtt ggg tct gcc aca gcg 



96 



7 




Val Lys Val 



tat acg ttg 
Tyr Thr Leu 
35 

gtg aac aaa 
Val Asn Lys 
50 

gca cct tea 
Ala Pro Ser 
65 

ggc gcg gec 
Gly Ala Ala 



caa aca agg 
Gin Thr Arg 



ate ate ccc 
lie lie Pro 
115 

gee acg aat 
Ala Thr Asn 
130 

ggg ttt cca 
Gly Phe Pro 
145 

get cgt ttt 
Ala Arg Phe 



age ate gat 
Ser lie Asp 



gtc tgg tct 
Val Trp Ser 
195 

gaa aaa gee 
Glu Lys Ala 
210 

gag caa acg 
Glu Gin Thr 
225 

act tea tat 
Thr Ser Tyr 



Val Val Val 
20 

ctt etc age 
Leu Leu Ser 



gac aaa gca 
Asp Lys Ala 



aat aca agg 
Asn Thr Arg 
70 

att gtt att 
He Val He 
85 

atg gat ctt 
Met Asp Leu 
100 

aat gtt gec 
Asn Val Ala 



cct gtc gat 
Pro Val Asp 



eta age aga 
Leu Ser Arg 
150 

aaa tac ate 
Lys Tyr He 
165 

gec tgt gta 
Ala Cys Val 
180 

ctt ace aac 
Leu Thr Asn 



aac cac ata 
Asn His He 



cga gac get 
Arg Asp Ala 
230 

gga ate gca 
Gly He Ala 



Gly Val Gly 
25 

ggc ate gtt 
Gly He Val 
40 

gag ggt gaa 
Glu Gly Glu 
55 

tct cga gcg 
Ser Arg Ala 



gtc aca tgt 
Val Thr Cys 



get gca aaa 
Ala Ala Lys 
105 

aaa tat get 
Lys Tyr Ala 
120 

gtt ttg acc 
Val Leu Thr 
135 

gtt ate ggc 
Val He Gly 



etc gga gag 
Leu Gly Glu 



att gga gaa 
He Gly Glu 
185 

ate gac ggc 
He Asp Gly 
200 

ttt gat cag 
Phe Asp Gin 
215 

get tac gat 
Ala Tyr Asp 



gcg gga tta 
Ala Gly Leu 



Ser Val Gly 



tec gag att 
Ser Glu He 



age atg gac 
Ser Met Asp 
60 

ggt gat tat 
Gly Asp Tyr 
75 

ggg att aac 
Gly He Asn 
90 

aat gee aac 
Asn Ala Asn 



cct gat acc 
Pro Asp Thr 



tat att age 
Tyr He Ser 
140 

tea ggt aca 
Ser Gly Thr 
155 

cac ttc aag 
His Phe Lys 
170 

cat ggt gat 
His Gly Asp 



atg aag etc 
Met Lys Leu 



aat gcg ttc 
Asn Ala Phe 
220 

ate ate aag 
He He Lys 
235 

ctt cgc ata 
Leu Arg He 



Ser Ala Thr 
30 

gtc ctt ate 
Val Leu He 
45 

tta aac cac 
Leu Asn His 



cct gac tgc 
Pro Asp Cys 



caa aaa aat 
Gin Lys Asn 
95 

att atg ctg 
He Met Leu 
110 

ate ctg ctt 
lie Leu Leu 
125 

tat aag gcg 
Tyr Lys Ala 



gtt ctg gat 
Val Leu Asp 



ate tea teg 
lie Ser Ser 
175 

teg ggt gtg 
Ser Gly Val 
190 

egg gat tac 
Arg Asp Tyr 
205 

cat aga ate 
His Arg He 



cgc aaa ggc 
Arg Lys Gly 



gta aag gcg 
Val Lys Ala 



Ala 



gac 144 
Asp 

gca 192 
Ala 



get 240 

Ala 

80 

gga 288 
Gly 



gaa 336 
Glu 



att 384 
He 



tea 432 
Ser 



act 480 

Thr 

160 

gac 528 
Asp 



cct 576 
Pro 



tgc 624 
Cys 



ttt 672 
Phe 



tat 720 

Tyr 

240 

att 768 
He 



8 




O 9932 ¥3 G . O 5 2 SO S 




245 



250 



255 



tta gag gat aca gga tec aca ctt aca gtt tea acc gtt ggt gat tat 
Leu Glu Asp Thr Gly Ser Thr Leu Thr Val Ser Thr Val Gly Asp Tyr 

260 265 270 

ttt ggg gtt gaa caa att get ata age gtc cct acc aaa etc aat aaa 
Phe Gly Val Glu Gin He Ala He Ser Val Pro Thr Lys Leu Asn Lys 
275 280 285 

agt ggg get cat caa gtg get gaa ctt tea etc gat gag aag gaa ata 
Ser Gly Ala His Gin Val Ala Glu Leu Ser Leu Asp Glu Lys Glu He 
290 295 300 

gaa ttg atg gaa aaa tea get agt cag ate aaa tea gtg att gag cat 
Glu Leu Met Glu Lys Ser Ala Ser Gin He Lys Ser Val He Glu His 
305 310 315 320 

ctg gag ate aat 
Leu Glu He Asn 



816 



864 



912 



960 



972 



<210> 22 
<211> 324 
<212> PRT 

<213> Kluyveromyces thermotolerans 
<400> 22 

Met Phe Gin Asp Thr Lys Ser Gin Ala Val Arg Thr Asp Ala Lys Thr 
15 10 15 



Val Lys Val Val Val Val Gly Val Gly Ser Val Gly Ser Ala Thr Ala 

20 25 30 



Tyr Thr Leu Leu Leu Ser Gly He Val Ser Glu He Val Leu He Asp 
35 40 45 



Val Asn Lys Asp Lys Ala Glu Gly Glu Ser Met Asp Leu Asn His Ala 
50 55 60 



Ala Pro Ser Asn Thr. Arg Ser Arg Ala Gly Asp Tyr Pro Asp Cys Ala 
65 70 75 80 



Gly Ala Ala He Val He Val Thr Cys Gly He Asn Gin Lys Asn Gly 

85 90 95 



Gin Thr Arg Met Asp Leu Ala Ala Lys Asn Ala Asn He Met Leu Glu 

100 105 110 



9 



lie lie Pro Asn Val Ala Lys Tyr Ala Pro Asp Thr lie Leu Leu lie 
115 120 125 



Ala Thr Asn Pro Val Asp Val Leu Thr Tyr lie Ser Tyr Lys Ala Ser 
130 135 140 



Gly Phe Pro Leu Ser Arg Val lie Gly Ser Gly Thr Val Leu Asp Thr 
145 150 155 160 



Ala Arg Phe Lys Tyr He Leu Gly Glu His Phe Lys He Ser Ser Asp 

165 170 175 



Ser He Asp Ala Cys Val He Gly Glu His Gly Asp Ser Gly Val Pro 

180 185 190 



Val Trp Ser Leu Thr Asn He Asp Gly Met Lys Leu Arg Asp Tyr Cys 
195 200 205 



Glu Lys Ala Asn His He Phe Asp Gin Asn Ala Phe His Arg lie Phe 
210 215 220 



Glu Gin Thr Arg Asp Ala Ala Tyr Asp lie lie Lys Arg Lys Gly Tyr 
225 230 235 240 



Thr Ser Tyr Gly He Ala Ala Gly Leu Leu Arg He Val Lys Ala He 

245 250 255 



Leu Glu Asp Thr Gly Ser Thr Leu Thr Val Ser Thr Val Gly Asp Tyr 

260 265 270 



Phe Gly Val Glu Gin He Ala He Ser Val Pro Thr Lys Leu Asn Lys 
275 280 285 



Ser Gly Ala His Gin Val Ala Glu Leu Ser Leu Asp Glu Lys Glu He 
290 295 300 



Glu Leu Met Glu Lys Ser Ala Ser Gin He Lys Ser Val lie Glu His 
305 310 315 320 



Leu Glu He Asn 



10 
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<210> 23 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> degenerate amplification primers 



<400> 23 

gtyggtgchg gtgchgthgg 



20 



<210> 24 

<211> 17 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> degenerate amplification primers 

<400> 24 

swrtcdccrt gytcbcc 17 



<210> 25 

<211> 27 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> amplification primers 



<400> 25 

atccacaaca gcttacacgt tattgag 27 



<210> 26 

<211> 28 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> amplification primers 



<400> 26 

gtttggttgc tggaagtggt gttgatag 28 



<210> 27 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> amplification primers 

<400> 27 



11 
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aacattgaat agcttgctca ggttgtg 



27 



<210> 28 

<211> 28 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> amplification primers 
<400> 28 

gataataaac gcgttgacat ttcagatg 



28 



<210> 29 

<211> 939 

<212> DNA 

<213> Torulaspora pretoriensis 



<220> 
<221> 
<222> 
<223> 



CDS 
(1) 



. (939) 



<400> 29 

atg cat aga tgt get aaa gtg gec ate gtc ggt gec ggc caa gtt gga 
Met His Arg Cys Ala Lys Val Ala He Val Gly Ala Gly Gin Val Gly 
15 10 15 

tec aca aca get tac acg tta tta ttg agt agt ttg gtt get gaa gtg 
Ser Thr Thr Ala Tyr Thr Leu Leu Leu Ser Ser Leu Val Ala Glu Val 

20 25 30 

gtg ttg ata gat gtc gat aaa aga aag gtc gaa ggc caa ttt atg gat 
Val Leu He Asp Val Asp Lys Arg Lys Val Glu Gly Gin Phe Met Asp 
35 40 45 

ctg aac cac gcg get cct tta acg aag gag tea cga ttc agt get ggg 
Leu Asn His Ala Ala Pro Leu Thr Lys Glu Ser Arg Phe Ser Ala Gly 
50 55 60 

gac tat gaa agt tgt get gat get gcg gtt gta ate gta acg ggc ggg 
Asp Tyr Glu Ser Cys Ala Asp Ala Ala Val Val He Val Thr Gly Gly 
65 70 75 80 

get aat cag aaa cct ggt caa act aga atg gag eta gec gag agg aac 
Ala Asn Gin Lys Pro Gly Gin Thr Arg Met Glu Leu Ala Glu Arg Asn 

85 90 95 

gtt aaa ate atg cag gaa gtg ate cct aag att gtg aaa tac gee ccc 
Val Lys He Met Gin Glu Val He Pro Lys He Val Lys Tyr Ala Pro 

100 105 110 

aac gca att ttg ctg att gca aca aac cct gtc gat gta ctt ace tat 
Asn Ala He Leu Leu He Ala Thr Asn Pro Val Asp Val Leu Thr Tyr 



48 



96 



144 



192 



240 



288 



336 



384 



12 




aaa gtg gag gat tac tgt agg cag tec aag aga aag ttt gac ccc age 
Lys Val Glu Asp Tyr Cys Arg Gin Ser Lys Arg Lys Phe Asp Pro Ser 
195 200 205 

att ctg acc aaa ata tat. gag gag teg cgt gac gcg gca gee tac ate 
He Leu Thr Lys He Tyr„Glu Glu Ser Arg Asp Ala Ala Ala Tyr He 
210 215 220 

ata gaa cgc aaa ggc tat acc aat ttc ggg att gca gca ggt ttg get 
He Glu Arg Lys Gly Tyr Thr Asn Phe Gly He Ala Ala Gly Leu Ala 
225 230 235 240 

agg ata gtg aga get att ctg aga gat gaa ggt gee eta tta act gtg 
Arg He Val Arg Ala He Leu Arg Asp Glu Gly Ala Leu Leu Thr Val 

245 250 255 

tct act gta ggt gag cac ttt ggc atg aaa gat gtt tea ttg agt gtt 
Ser Thr Val Gly Glu His Phe Gly Met Lys Asp Val Ser Leu Ser Val 

260 265 270 

cca act agg gta gac agg age ggc get cac cat gtc gtc gac ctt ctg 
Pro Thr Arg Val Asp Arg Ser Gly Ala His His Val Val Asp Leu Leu 
275 280 285 

eta aac gac aag gag ctg gag caa att aaa aca tct gga gee aag ata 
Leu Asn Asp Lys Glu Leu Glu Gin He Lys Thr Ser Gly Ala Lys He 
290 295 300 

aag tea gee tgt gat gaa ctt ggc att 
Lys Ser Ala Cys Asp Glu Leu Gly lie 
305 310 



115 120 125 

get agt ttg aaa gcg teg gga ttc cca gca age egg gtt att ggt tct 432 
Ala Ser Leu Lys Ala Ser Gly Phe Pro Ala Ser Arg Val lie Gly Ser 
130 135 140 

ggg aca gtt etc gac tct get cgt ata cag cac aac ctg age aag eta 480 
Gly Thr Val Leu Asp Ser Ala Arg He Gin His Asn Leu Ser Lys Leu 
145 150 155 160 

ttc aat gtt tea tct gaa agt gtc aac gcg ttt att ate ggg gaa cat 528 
Phe Asn Val Ser Ser Glu Ser Val Asn Ala Phe He lie Gly Glu His 

165 170 175 

ggt gac tea agt gtg ccc gtc tgg teg ctt get gag att gee ggc atg 576 
Gly Asp Ser Ser Val Pro Val Trp Ser Leu Ala Glu lie Ala Gly Met 

180 185 190 



624 



672 



720 



768 



816 



864 



912 



939 



<210> 30 

<211> 313 

<212> PRT 

<213> Torulaspora pretoriensis 

<400> 30 
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Met His Arg Cys Ala Lys Val Ala He Val Gly Ala Gly Gin Val Gly 
1 5 10 15 



Ser Thr Thr Ala Tyr Thr Leu Leu Leu Ser Ser Leu Val Ala Glu Val 

20 25 30 



Val Leu He Asp Val Asp Lys Arg Lys Val Glu Gly Gin Phe Met Asp 
35 40 45 



Leu Asn His Ala Ala Pro Leu Thr Lys Glu Ser Arg Phe Ser Ala Gly 
50 55 60 



Asp Tyr Glu Ser Cys Ala Asp Ala Ala Val Val He Val Thr Gly Gly 
65 70 75 80 



Ala Asn Gin Lys Pro Gly Gin Thr Arg Met Glu Leu Ala Glu Arg Asn 

85 90 95 



Val Lys lie Met Gin Glu Val He Pro Lys He Val Lys Tyr Ala Pro 

100 105 110 



Asn Ala He Leu Leu He Ala Thr Asn Pro Val Asp Val Leu Thr Tyr 
115 120 125 



Ala Ser Leu Lys Ala Ser Gly Phe Pro Ala Ser Arg Val He Gly Ser 
130 135 140 



Gly Thr Val Leu Asp Ser Ala Arg He Gin His Asn Leu Ser Lys Leu 
145 150 155 160 



Phe Asn Val Ser Ser Glu Ser Val Asn Ala Phe He He Gly Glu His 

165 170 175 



Gly Asp Ser Ser Val Pro Val Trp Ser Leu Ala Glu He Ala Gly Met 

180 185 190 



Lys Val Glu Asp Tyr Cys Arg Gin Ser Lys Arg Lys Phe Asp Pro Ser 
195 200 205 



He Leu Thr Lys He Tyr Glu Glu Ser Arg Asp Ala Ala Ala Tyr He 
210 215 220 
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lie Glu Arg Lys Gly Tyr 
225 230 



Thr Asn Phe Gly lie Ala Ala Gly Leu Ala 

235 240 



Arg lie Val Arg Ala lie 

245 



Leu Arg Asp Glu Gly Ala Leu Leu Thr Val 

250 255 



Ser Thr Val Gly Glu His 

260 



Phe Gly Met Lys Asp Val Ser Leu Ser Val 
265 270 



Pro Thr Arg Val Asp Arg 
275 



Ser Gly Ala His His Val Val Asp Leu Leu 
280 285 



Leu Asn Asp Lys Glu Leu 
290 



Glu Gin lie Lys Thr Ser Gly Ala Lys lie 
295 300 



Lys Ser Ala Cys Asp Glu 
305 310 



Leu Gly lie 



<210> 31 
<211> 21 
<212> DNA 

<213> Bacillus megaterium 
<400> 31 

cctgagtcca cgtcattatt c 21 



<210> 32 

<211> 22 

<212> DNA 

<213> Bacillus megaterium 



<210> 33 

<211> 27 

<212> DNA 

<213> Bacillus megaterium 

<400> 33 

gctctagatg aaaacacaat ttacacc 27 

<210> 34 

<211> 28 

<212> DNA 

<213> Bacillus megaterium 



<400> 32 

tgaagctatt tattcttgtt ac 



22 



<400> 



34 
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atggatcctt acacaaaagc tctgtcgc 



28 



<210> 35 
<211> 26 
<212> DNA 

<213> Rhizopus oryzae 
<400> 35 

ctttattttt ctttacaata taattc 26 



<210> 36 

<211> 19 

<212> DNA 

<213> Rhizopus oryzae 



<210> 37 

<211> 29 

<212> DNA 

<213> Rhizopus oryzae 

<400> 37 

gctctagatg gtattacact caaaggtcg 29 

<210> 38 

<211> 30 

<212> DNA 

<213> Rhizopus oryzae 



<210> 39 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> cloning site sequence 

<400> 39 

aaatctagat gagccatatt caacggga 28 

<210> 40 

<211> 29 

<212> DNA 

<213> Artificial sequence 



<400> 36 

actagcagtg caaaacatg 



19 



<400> 38 

gctctagatc aacagctact tttagaaaag 



30 



<220> 



16 




<223> cloning site sequence 
<400> 40 

ccggatcctt agaaaaactc atcgagcat 29 



<210> 41 
<211> 36 
<212> DNA 

<213> Kluyveromyces thermotolerans 
<400> 41 

gctctagaat tatgttccaa gatacaaagt ctcaag 36 

<210> 42 
<211> 34 
<212> DNA 

<213> Kluyveromyces thermotolerans 
<400> 42 

ccggaattca tcctcaattg atctccagat gctc 34 



<210> 43 
<211> 2229 
<212> DNA 

<213> Kluyveromyces thermotolerans 
<400> 43 

gcggccgcgg atcgctcttc cgctatcgat taattttttt ttctttcctc tttttattaa 60 

ccttaatttt tattttagat tcctgacctt caactcaaga cgcacagata ttataacatc 120 

tgcacaatag gcatttgcaa gaattactcg tgagtaagga aagagtgagg aactatcgca 180 

tacctgcatt taaagatgcc gatttgggcg cgaatccttt attttggctt caccctcata 240 

ctattatcag ggccagaaaa aggaagtgtt tccctccttc ttgaattgat gttaccctca 300 

taaagcacgt ggcctcttat cgagaaagaa attaccgtcg ctcgtgattt gtttgcaaaa 360 

agaacaaaac tgaaaaaacc cagacacgct cgacttcctg tcttcctatt gattgcagct 420 

tccaatttcg tcacacaaca aggtcctagc gacggctcac aggttttgta acaagcaatc 480 

gaaggttctg gaatggcggg aaagggttta gtaccacatg ctatgatgcc cactgtgatc 540 

tccagagcaa agttcgttcg atcgtactgt tactctctct ctttcaaaca gaattgtccg 600 

aatcgtgtga caacaacagc ctgttctcac acactctttt cttctaacca agggggtggt 660 

ttagtttagt agaacctcgt gaaacttaca tttacatata tataaacttg cataaattgg 720 

tcaatgcaag aaatacatat ttggtctttt ctaattcgta gtttttcaag ttcttagatg 780 

ctttcttttt ctctttttta cagatcatca aggaagtaat tatctacttt ttacaacaaa 840 
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tctagaatta 


tgttccaaga 


tacaaagtct 


caagcagtaa 


gaactgatgc 


caaaacagta 


900 


aaagttgtgg 


tagtgggagt 


gggaagtgtt 


gggtctgcca 


cagcgtatac 


gttgcttctc 


960 


agcggcatcg 


tttccgagat 


tgtccttatc 


gacgtgaaca 


aagacaaagc 


agagggtgaa 


1020 


agcatggact 


taaaccacgc 


agcaccttca 


aatacaaggt 


ctcgagcggg 


tgattatcct 


1080 


gactgcgctg 


gcgcggccat 


tgttattgtc 


acatgtggga 


ttaaccaaaa 


aaatggacaa 


1140 


acaaggatgg 


atcttgctgc 


aaaaaatgcc 


aacattatgc 


tggaaatcat 


ccccaatgtt 


1200 


gccaaatatg 


ctcctgatac 


catcctgctt 


attgccacga 


atcctgtcga 


tgttttgacc 


1260 


tatattagct 


ataaggcgtc 


agggtttcca 


ctaagcagag 


ttatcggctc 


aggtacagtt 


1320 


ctggatactg 


ctcgttttaa 


atacatcctc 


ggagagcact 


tcaagatctc 


atcggacagc 


1380 


atcgatgcct 


gtgtaattgg 


agaacatggt 


gattcgggtg 


tgcctgtctg 


gtctcttacc 


1440 


aacatcgacg 


gcatgaagct 


ccgggattac 


tgcgaaaaag 


ccaaccacat 


atttgatcag 


1500 


aatgcgttcc 


atagaatctt 


tgagcaaacg 


cgagacgctg 


cttacgatat 


catcaagcgc 


1560 


aaaggctata 


cttcatatgg 


aatcgcagcg 


ggattacttc 


gcatagtaaa 


ggcgatttta 


1620 


gaggatacag 


gatccacact 


tacagtttca 


accgttggtg 


attattttgg 


ggttgaacaa 


1680 


attgctataa 


gcgtccctac 


caaactcaat 


aaaagtgggg 


ctcatcaagt 


ggctgaactt 


1740 


tcactcgatg 


agaaggaaat 


agaattgatg 


gaaaaatcag 


ctagtcagat 


caaatcagtg 


1800 


attgagcatc 


tggagatcaa 


ttgaggatga 


attcggatcc 


ggtagataca 


ttgatgctat 


1860 


caatccagag 


aactggaaag 


attgtgtagc 


cttgaaaaac 


ggtgaaactt 


acgggtccaa 


1920 


gattgtctac 


agattttcct 


gatttgccag 


cttactatcc 


ttcttgaaaa 


tatgcactct 


1980 


atatctttta 


gttcttaatt 


gcaacacata 


gatttgctgt 


ataacgaatt 


ttatgctatt 


2040 


1— 4— ±~ 4_ A A A 1_ 4_ 4. 

ttttaaattt 


ggagttcagt 


gataaaagtg 


tcacagcgaa 


^* 4^ S*« 4« &m *"\ 

tttcctcaca 


tgcagggacc 


Z1UU 


gaattgttta 


caagttctct 


gtaccaccat 


ggagacatca 


aaaattgaaa 


atctatggaa 


2160 


agatatggac 


ggtagcaaca 


agaatatagc 


acgagccgcg 


gatttatttc 


gttacgcatg 


2220 


cgcggccgc 












2229 


<210> 44 
<211> 32 
<212> DNA 

<213> Candida sonorensis 










<400> 44 
tggactagta 


aaccaacagg 


gattgcctta 


gt 






32 
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<210> 45 
<211> 33 
<212> DNA 

<213> Candida sonorensis 
<400> 45 

ctagtctaga gatcattacg ccagcatcct agg 33 



<210> 46 

<211> 37 

<212> DNA 

<213> Candida albicans 



<210> 47 
<211> 36 
<212> DNA 

<213> Candida albicans 
<400> 47 

acttggccat ggtgatagtt attcttctgc aattga 36 



<210> 48 

<211> 20 

<212> DNA 

<213> Saccharomyces cerevisiae 



<210> 49 

<211> 20 

<212> DNA 

<213> Saccharomyces cerevisiae 

<400> 49 

ttaagccttg gcaacatatt 20 

<210> 50 

<211> 37 

<212> DNA 

<213> Candida albicans 



<400> 46 

gcgatctcga ggtcctagaa tatgtatact aatttgc 



37 



<400> 48 

tgtcatcact gctccatctt 



20 



<400> 50 

gcgatctcga ggtcctagaa tatgtatact aatttgc 



37 



<210> 



51 



19 
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<211> 
<212> 
<213> 



39 
DNA 

Candida albicans 



<400> 51 

cgcgaattcc catggttagt ttttgttgga aagagcaac 



39 



<210> 52 

<211> 32 

<212> DNA 

<213> Candida sonorensis 

<400> 52 

tggactagta aaccaacagg gattgcctta gt 32 

<210> 53 

<211> 33 

<212> DNA 

<213> Candida sonorensis 



<210> 54 

<211> 44 

<212> DNA 

<213> Candida sonorensis 

<400> 54 

ccggaattcg atatctgggc wggkaatgcc aaygarttra atgc 44 

<210> 55 

<211> 44 

<212> DNA 

<213> Candida sonorensis 



<223> primer that does not encode amino acid 
<220> 

<221> misc_feature 

<222> (21) . . (21) 

<223> n stands for any nucleotide 



<220> 

<221> misc_feature 
<222> (33).. (33) 

<223> n stands for any nucleotide 
<400> 55 

cgcggattca ggcctcagta ngaraawgaa ccngtrttra arte 44 



<400> 53 

ctagtctaga gatcattacg ccagcatcct agg 



33 



<220> 



20 
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<210> 56 
<211> 10 
<212> PRT 

<213> Candida sonorensis 
<400> 56 

Trp Ala Gly Asn Ala Asn Glu Leu Asn Ala 
15 10 



<210> 57 
<211> 10 
<212> PRT 

<213> Candida sonorensis 
<400> 57 

Asp Phe Asn Thr Gly Ser Phe Ser Tyr Ser 
15 10 



<210> 58 

<211> 18 

<212> DNA 

<213> Candida sonorensis 

<400> 58 

tctgttmcct acrtaaga 18 

<210> 59 

<211> 20 

<212> DNA 

<213> Candida sonorensis 



<210> 60 

<211> 36 

<212> DNA 

<213> Candida sonorensis 

<400> 60 

gcgatctcga gaaagaaacg acccatccaa gtgatg 36 

<210> 61 

<211> 68 

<212> DNA 

<213> Candida sonorensis 



<400> 59 

gtyggtggtc acgaaggtgc 



20 



<400> 



61 



21 
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tggactagta catgcatgcg gtgagaaagt agaaagcaaa cattgtatat agtcttttct 



60 



attattag 



68 



<210> 62 
<211> 34 
<212> DNA 

<213> Candida sonorensis 
<400> 62 

gcgatctcga gaaaatgtta ttataacact acac 34 



<210> 63 

<211> 75 

<212> DNA 

<213> Candida sonorensis 



<210> 64 
<211> 36 
<212> DNA 

<213> Candida sonorensis 
<400> 64 

gcgatctcga gaaagaaacg acccatccaa gtgatg 36 



<210> 65 

<211> 35 

<212> DNA 

<213> Candida sonorensis 



<400> 63 

tggactagta catgcatgcg gtgagaaagt agaaagcaaa cattttgttt gatttgtttg 



60 



ttttgttttt gtttg 



75 



<400> 65 

acttggccat ggtatatagt cttttctatt attag 



35 



22 



